Tibetan speech synthesis based on an improved neural network
نویسندگان
چکیده
Nowadays, Tibetan speech synthesis based on neural network has become the mainstream method. Among them, griffin-lim vocoder is widely used in because of its relatively simple synthesis.Aiming at problem low fidelity vocoder, this paper uses WaveNet instead for synthesis.This first convolution operation and attention mechanism to extract sequence features.And then linear projection feature amplification module predict mel spectrogram.Finally, use synthesize waveform. Experimental data shows that our model a better performance synthesis.
منابع مشابه
AN IMPROVED CONTROLLED CHAOTIC NEURAL NETWORK FOR PATTERN RECOGNITION
A sigmoid function is necessary for creation a chaotic neural network (CNN). In this paper, a new function for CNN is proposed that it can increase the speed of convergence. In the proposed method, we use a novel signal for controlling chaos. Both the theory analysis and computer simulation results show that the performance of CNN can be improved remarkably by using our method. By means of this...
متن کاملDeep neural network-based statistical parametric speech synthesis system using improved time-frequency trajectory excitation model
This paper proposes a deep neural network (DNN)-based statistical parametric speech synthesis system using an improved time-frequency trajectory excitation (ITFTE) model. The ITFTE model, which efficiently reduces the parametric redundancy of a TFTE model, improved the perceptual quality of the vocoding process and the estimation accuracy of the training process. However, there remain problems ...
متن کاملAn Improved BP Neural Network Algorithm Based on Factor Analysis
Back-Propagation (BP) neural network, as one of the most mature and most widespread algorithms, has the ability of large scale computing and has unique advantages when dealing with nonlinear high dimensional data. But when we manipulate high dimensional data with BP neural network, many feature variables provide enough information, but too many network inputs go against designing of the hidden-...
متن کاملTraffic Prediction Based on Improved Neural Network
Artificial neural networks and genetic algorithms derived from the corresponding simulation of biology, anatomy. The paper analyzes the advantages and the disadvantages of the artificial neural networks and genetic algorithms. The artificial neural networks and genetic algorithms to be combine in the prediction model. This method is used to predict traffic volume in a road, the accuracy of fore...
متن کاملPhone-based speech synthesis with neural network and articulatory control
This paper presents a novel method for synthesizing speech signal using a phone-based concatenation approach. Neural network is employed for the generalization of the phone templates during synthesis. Simpli ed articulatory space input parameters based on a modi ed vowel diagram are used to provide exible and e ective articulatory control. It also enables the design of an articulatory control m...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: MATEC web of conferences
سال: 2021
ISSN: ['2261-236X', '2274-7214']
DOI: https://doi.org/10.1051/matecconf/202133606012